Search CORE

382 research outputs found

Genephony: a knowledge management tool for genome-wide research

Author: A Kasprzyk
Alberto Riva
Angelo Nuzzo
B Giardine
G Dennis Jr
KG Becker
L Stein
S Philippi
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

A Formal Approach to Support Interoperability in Scientific Meta-workflows

Author: B Giardine
E Bertin
G Castelli
G Terstyanszky
Gabor Terstyanszky
Giuliano Taffoni
J Kranjc
J Sroka
Junaid Arshad
Noam Weingarten
S Herres-Pawlis
T Oinn
Tamas Kiss
WM Van Der Aalst
Y Toda
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Scientific workflows orchestrate the execution of complex experiments frequently using distributed computing platforms. Meta-workflows represent an emerging type of such workflows which aim to reuse existing workflows from potentially different workflow systems to achieve more complex and experimentation minimizing workflow design and testing efforts. Workflow interoperability plays a profound role in achieving this objective. This paper is focused at fostering interoperability across meta-workflows that combine workflows of different workflow systems from diverse scientific domains. This is achieved by formalizing definitions of meta-workflow and its different types to standardize their data structures used to describe workflows to be published and shared via public repositories. The paper also includes thorough formalization of two workflow interoperability approaches based on this formal description: the coarse-grained and fine-grained workflow interoperability approach. The paper presents a case study from Astrophysics which successfully demonstrates the use of the concepts of meta-workflows and workflow interoperability within a scientific simulation platform

Crossref

UWL Repository

OA@INAF - Istituto Nazionale di Astrofisica

WestminsterResearch

VariVis: a visualisation toolkit for variation databases

Author: B Giardine
B Staats
C Béroud
C Kanz
CR Scriver
DA Benson
GP Patrinos
IFAC Fokkema
JE Stajich
JT den Dunnen
JT den Dunnen
M Claustres
P Riikonen
RGH Cotton
RGH Cotton
RGH Cotton
Richard GH Cotton
SM Maurer
Timothy D Smith
Publication venue: BioMed Central
Publication date: 01/04/2008
Field of study

Abstract Background With the completion of the Human Genome Project and recent advancements in mutation detection technologies, the volume of data available on genetic variations has risen considerably. These data are stored in online variation databases and provide important clues to the cause of diseases and potential side effects or resistance to drugs. However, the data presentation techniques employed by most of these databases make them difficult to use and understand. Results Here we present a visualisation toolkit that can be employed by online variation databases to generate graphical models of gene sequence with corresponding variations and their consequences. The VariVis software package can run on any web server capable of executing Perl CGI scripts and can interface with numerous Database Management Systems and "flat-file" data files. VariVis produces two easily understandable graphical depictions of any gene sequence and matches these with variant data. While developed with the goal of improving the utility of human variation databases, the VariVis package can be used in any variation database to enhance utilisation of, and access to, critical information.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of Melbourne Institutional Repository

WeBIAS: a web server for publishing bioinformatics applications

Author: A Papanicolaou
B Giardine
B Néron
B Wilczyński
Bartek Wilczyński
Bogdan Lesyng
J Ren
P Daniluk
Paweł Daniluk
PJA Cock
T Oinn
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

The UCSC Genome Browser Database: 2008 update

Author: Baertsch R.
Barber G. P.
Clawson H.
Diekhans M.
Giardine B.
Harte R. A.
Haussler D.
Hinrichs A. S.
Hsu F.
Karolchik D.
Kent W. J.
Kober K. M.
Kuhn R. M.
Miller W.
Pedersen J. S.
Pohl A.
Raney B. J.
Rhead B.
Rosenbloom K. R.
Smith K. E.
Stanke M.
Thakkapallayil A.
Trumbower H.
Wang T.
Zweig A. S.
Publication venue
Publication date: 02/08/2017
Field of study

The University of California, Santa Cruz, Genome Browser Database (GBD) provides integrated sequence and annotation data for a large collection of vertebrate and model organism genomes. Seventeen new assemblies have been added to the database in the past year, for a total coverage of 19 vertebrate and 21 invertebrate species as of September 2007. For each assembly, the GBD contains a collection of annotation data aligned to the genomic sequence. Highlights of this year's additions include a 28-species human-based vertebrate conservation annotation, an enhanced UCSC Genes set, and more human variation, MGC, and ENCODE data. The database is optimized for fast interactive performance with a set of web-based tools that may be used to view, manipulate, filter and download the annotation data. New toolset features include the Genome Graphs tool for displaying genome-wide data sets, session saving and sharing, better custom track management, expanded Genome Browser configuration options and a Genome Browser wiki site. The downloadable GBD data, the companion Genome Browser toolset and links to documentation and related information can be found at: http://genome.ucsc.ed

RERO DOC Digital Library

Characteristics of transposable element exonization within human and mouse

Author: A Athanasiadis
A Corvelo
A Gerber
A Goren
A Levy
A Magen
A Nekrutenko
A Resch
AFA Smit
Agnes Hotz-Wagenblatt
B Giardine
B Mersch
BR Graveley
Britta Mersch
C Liu
D Karolchik
D Labuda
DD Kim
E Kim
ES Lander
EY Levanon
G Ast
G Lev-Maor
G Lev-Maor
Gil Ast
H Xie
Ilya Ruvinsky
J Hull
J Jurka
J Jurka
JO Kriegs
JO Yang
JP Nemes
K Nakabayashi
KP Kister
L Lin
L Lin
M Amit
M Blow
M Krull
M Moller-Krull
M Roy
M Sironi
M Sironi
MA Batzer
MD Koob
N Gal-Mark
N Gal-Mark
N Sela
NH Gehring
Noa Sela
O Ram
P Deininger
PL Deininger
R Cordaux
R Sorek
R Sorek
R Sorek
RA Gibbs
RE Mills
RH Waterston
RM Kuhn
RT Hillman
S He
S Schwartz
SK Ng
SS Singer
ST Sherry
T Kwan
T Kwan
VV Kapitonov
W Makalowski
WJ Kent
WL Chen
WS Lo
XH Zhang
Y Xing
YF Chang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/06/2010
Field of study

Insertion of transposed elements within mammalian genes is thought to be an important contributor to mammalian evolution and speciation. Insertion of transposed elements into introns can lead to their activation as alternatively spliced cassette exons, an event called exonization. Elucidation of the evolutionary constraints that have shaped fixation of transposed elements within human and mouse protein coding genes and subsequent exonization is important for understanding of how the exonization process has affected transcriptome and proteome complexities. Here we show that exonization of transposed elements is biased towards the beginning of the coding sequence in both human and mouse genes. Analysis of single nucleotide polymorphisms (SNPs) revealed that exonization of transposed elements can be population-specific, implying that exonizations may enhance divergence and lead to speciation. SNP density analysis revealed differences between Alu and other transposed elements. Finally, we identified cases of primate-specific Alu elements that depend on RNA editing for their exonization. These results shed light on TE fixation and the exonization process within human and mouse genes.Comment: 11 pages, 4 figure

arXiv.org e-Print Archive

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Clinically relevant updates of the HbVar database of human hemoglobin variants and thalassemia mutations

Author: Giardine B. (Belinda)
Hardison R.C. (Ross)
Joly P. (Philippe)
K Chui D.H. (David H.)
Patrinos G.P. (George)
Pissard S. (Serge)
Wajcman H. (Henri)
Publication venue: 'Oxford University Press (OUP)'
Publication date: 08/01/2021
Field of study

HbVar (http://globin.bx.psu.edu/hbvar) is a widely-used locus-specific database (LSDB) launched 20 years ago by a multi-center academic effort to provide timely information on the numerous genomic variants leading to hemoglobin variants and all types of thalassemia and hemoglobinopathies. Here, we report several advances for the database. We made clinically relevant updates of HbVar, implemented as additional querying options in the HbVar query page, allowing the user to explore the clinical phenotype of compound heterozygous patients. We also made significant improvements to the HbVar front page, making comparative data querying, analysis and output more user-friendly. We continued to expand and enrich the regular data content, involving 1820 variants, 230 of which are new entries. We also increased the querying potential and expanded the usefulness of HbVar database in the clinical setting. These several additions, expansions and updates should improve the utility of HbVar both for the globin research community and in a clinical setting

Erasmus University Digital Repository

QuickNGS elevates Next-Generation Sequencing data analysis to a new level of automation

Author: B Giardine
D Kim
H Li
H Li
J Feng
J Rynes
M Reich
MA Kallio
Miloš Nikolić
MR Friedländer
P Cingolani
P Machanick
Peter Frommolt
Prerana Wagle
S Anders
S Anders
S Durinck
S Griffiths-Jones
T Hubbard
T Rausch
WJ Kent
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

WordCluster: detecting clusters of DNA words and genomic elements

Author: A Sandelin
A Siepel
AR Quinlan
B Giardine
D Durand
D Karolchik
Guillermo Barturen
José L Oliver
KD Pruitt
M Ashburner
M Gardiner-Garden
M Hackenberg
M Hackenberg
M Hackenberg
M Hackenberg
Michael Hackenberg
P Carpena
Pedro Bernaola-Galván
Pedro Carpena
R Aloni
R Lister
TJ Hubbard
VJ Makeev
Ángel M Alganza
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Many <it>k-</it>mers (or DNA words) and genomic elements are known to be spatially clustered in the genome. Well established examples are the genes, TFBSs, CpG dinucleotides, microRNA genes and ultra-conserved non-coding regions. Currently, no algorithm exists to find these clusters in a statistically comprehensible way. The detection of clustering often relies on densities and sliding-window approaches or arbitrarily chosen distance thresholds. Results We introduce here an algorithm to detect clusters of DNA words (<it>k-</it>mers), or any other genomic element, based on the distance between consecutive copies and an assigned statistical significance. We implemented the method into a web server connected to a MySQL backend, which also determines the co-localization with gene annotations. We demonstrate the usefulness of this approach by detecting the clusters of CAG/CTG (cytosine contexts that can be methylated in undifferentiated cells), showing that the degree of methylation vary drastically between inside and outside of the clusters. As another example, we used <it>WordCluster </it>to search for statistically significant clusters of olfactory receptor (OR) genes in the human genome. Conclusions <it>WordCluster </it>seems to predict biological meaningful clusters of DNA words (<it>k-</it>mers) and genomic entities. The implementation of the method into a web server is available at <url>http://bioinfo2.ugr.es/wordCluster/wordCluster.php</url> including additional features like the detection of co-localization with gene regions or the annotation enrichment tool for functional analysis of overlapped genes.</p

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Repositorio Institucional Universidad de Granada

ORegAnno: an open-access community-driven resource for regulatory annotation

Author: A. Ticoll
B. Bernier
B. Chu
B. Giardine
B. Hooghe
Blanco
C. M. Bergman
C. Wadelius
D. Vlieghe
E. Blanco
E. Portales-Casamar
G. Robertson
Ghosh
Glenisson
Harbison
Ho Sui
I. J. Donaldson
Jiang
K. Kasaian
Kelso
Kim
M. Bilenky
M. C. Sleumer
M. Griffith
M. Haeussler
M. S. Halfon
Macisaac
Matys
Montgomery
O. L. Griffith
P. De Bleser
P. Van Loo
Portales-Casamar
R. Hardison
Ren
Robertson
Robertson
S. Aerts
S. B. Montgomery
S. J.M. Jones
S. Lithwick
S. M. Gallo
S. Mahony
Sierro
Tompa
Trinklein
Vlieghe
W. Wasserman
Wasserman
Wasserman
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

ORegAnno is an open-source, open-access database and literature curation system for community-based annotation of experimentally identified DNA regulatory regions, transcription factor binding sites and regulatory variants. The current release comprises 30 145 records curated from 922 publications and describing regulatory sequences for over 3853 genes and 465 transcription factors from 19 species. A new feature called the ‘publication queue’ allows users to input relevant papers from scientific literature as targets for annotation. The queue contains 4438 gene regulation papers entered by experts and another 54 351 identified by text-mining methods. Users can enter or ‘check out’ papers from the queue for manual curation using a series of user-friendly annotation pages. A typical record entry consists of species, sequence type, sequence, target gene, binding factor, experimental outcome and one or more lines of experimental evidence. An evidence ontology was developed to describe and categorize these experiments. Records are cross-referenced to Ensembl or Entrez gene identifiers, PubMed and dbSNP and can be visualized in the Ensembl or UCSC genome browsers. All data are freely available through search pages, XML data dumps or web services at: http://www.oreganno.org

CiteSeerX

Crossref

Ghent University Academic Bibliography

PubMed Central

The University of Manchester - Institutional Repository